Four-Ary Tree-Based Barrier Synchronization for 2D Meshes without Nonmember Involvement
نویسندگان
چکیده
ÐThis paper proposes a Barrier Tree for Meshes (BTM) to minimize the barrier synchronization latency for two-dimensional (2D) meshes. The proposed BTM scheme has two distinguishing features. First, the synchronization tree is 4-ary. The synchronization latency of the BTM scheme is asymptotically log4 n, while that of the fastest scheme reported in the literature is bounded between log3 n and O n1=2, where n is the number of member nodes. Second, nonmember nodes are neither involved in the construction of a BTM nor actively participating in the synchronization operations, which avoids interference among different process groups during synchronization. This not only results in low setup overhead, but also reduces the synchronization latency. The low setup overhead is particularly effective for the dynamic process model provided in MPI-2. Extensive simulation study shows that, for up to 64 64 meshes, the BTM scheme results in about 40 70 percent shorter synchronization latency and is more scalable than conventional schemes. Index TermsÐBarrier synchronization, hardware-supported barriers, communication latency, wormhole routing, MPI.
منابع مشابه
Fast Barrier Synchronization in Wormhole k-ary n-cube Networks with Multidestination Worms1
This paper presents a new approach to implement fast barrier synchronization in wormhole k-ary n-cubes. The novelty lies in using multidestination messages instead of the traditional single destination messages. Two diierent multidestination worm types, gather and broadcasting, are introduced to implement the report and wake-up phases of barrier synchronization , respectively. Algorithms for co...
متن کاملFast Barrier Synchronization in Wormhole k-ary n-cube Networks with Multidestination Worms
Proc. of the Int'l Symposium on High Performance Computer Architecture (HPCA '95), pp. 200-209. This paper presents a new approach to implement fast barrier synchronization in wormhole k-ary n-cubes. The novelty lies in using multidestination messages instead of the traditional single destination messages. Two diierent multidestination worm types, gather and broadcasting, are introduced to impl...
متن کاملGlobal Reduction in Wormhole K-ary N-cube Networks with Multidestination Exchange Worms 1
This paper presents a new approach to implement global reduction operations (including barrier synchronization) in wormhole k-ary n-cubes. The novelty lies in using multi-destination message passing mechanism instead of single destination (unicast) messages. Using pairwise exchange worms along each dimension, it is shown that global reduction and barrier synchronization operations, as deened by...
متن کاملBarrier Synchronization on Wormhole-Routed Networks
ÐIn this paper, we propose an efficient barrier synchronization scheme on networks with arbitrary topologies. We first present a distributed method in building a barrier routing tree. The barrier messages can be delivered adaptively according to the hierarchy of the established barrier tree to void congestion and faulty nodes in the network. We then propose a new technique, called bandwidth-pre...
متن کاملOptimal Phase Barrier Synchronization in K-ary N-cube Wormhole-routed Systems Using Multirendezvous Primitives
We propose a new multiphase barrier synchronization algorithm for k-ary n-cube networks using our earlier developed multirendezvous synchronization primitives 4]. The algorithm incurs minimal cost with optimal number of communication phases. The optimality is guided by system connguration parameters (k; n) and technological parameters t s (communication start-up cost) and t p (link-propagation ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Computers
دوره 50 شماره
صفحات -
تاریخ انتشار 2001